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Abstract — We generalize a construction of non-binary quantum 
LDPC codes over ¥2^ due to |KHIK11 1 and apply it in particular 
to toric codes. We obtain in this way not only codes with 
better rates than toric codes but also improve dramatically the 
performance of standard iterative decoding. Moreover, the new 
codes obtained in this fashion inherit the distance properties 
of the underlying toric codes and have therefore a minimum 
distance which grows as the square root of the length of the 
code for fixed m. 

I. Introduction 



LDPC codes |Gal63| and their variants are one of the 
most satisfying answers to the problem of devising codes 
guaranteed by Shannon's theorem. They display outstanding 
performance for a large class of error models with a fast 
decoding algorithm. Generalizing these codes to the quantum 
setting seems a promising way to devise powerful quantum 
error correcting codes for protecting, for instance, the very 
fragile superpositions manipulated in a quantum computer. It 
should be emphasized that a fast decoding algorithm could 
be even more crucial in the quantum setting than in the 
classical one. In the classical case, when error correction 
codes are used for communication over a noisy channel, the 
decoding time translates directly into communication delays. 
This has been the driving motivation to devise decoding 
schemes of low complexity, and is likely to be important in 
the quantum setting as well. However, there is an important 
additional motivation for efficient decoding in the quantum 
setting. Quantum computation is likely to require active sta- 
bilization. The decoding time thus translates into computation 
delays, and most importantly in error suppression delays. If 
errors accumulate faster than they can be identified, quantum 
computation may well become infeasible: fast decoding is an 
essential ingredient to fault-tolerant computation. 

Quantum generalizations of LDPC codes have indeed been 
proposed in |MMM04|. However, it has turned out that the 
design of high performance quantum LDPC codes is much 
more complicated than in the classical setting. This is due 
to several reasons, the most obvious of which being that the 
parity-check matrix of quantum LDPC codes must satisfy cer- 
tain orthogonality constraints. This complicates significantly 
the construction of such codes. In particular, the plain random 
constructions that work so well in the classical setting are 
pointless here. There have been a number of attempts at over- 
coming this difficulty and a variety of methods for constructing 
quantum LDPC codes have been proposed |Pos01], [Kit03|, 



[MMM04|, IC OT051, flCOT071, HLGF06I, HGFL081, IIHI07I . 
|IM07|, ]DjoQ8| , HSMK08I , | |Aly07| , | |Aly08) , OHBD081 . 
irrZ09l . MTL101 . IIKHIK11I . However, with the exception of 
|TZ09| which gives a construction of LDPC codes with 
minimum distance of the order of the square root of the block- 
length, all of these constructions suffer from disappointingly 
small minimum distances, namely whenever they have non- 
vanishing rate and parity-check matrices with bounded row- 
weight, their minimum distance is either proved to be bounded, 
or unknown and with little hope for unboundedness. 

The point has been made several times that minimum 
distance is not everything, because there are complex decoding 
issues involved, whose behavior depends only in part on the 
minimum distance, and also because a poor asymptotic be- 
havior may be acceptable when one limits oneself to practical 
lengths. This is illustrated for instance in our case by the codes 
constructed in 1KHIK111 whose performance under iterative 
decoding is quite good even if their minimum distance might 
be bounded. Their construction can be summarized as follows. 
There are three ingredients: 



(i) 



(ii) 



The starting point is a CSS quantum code associated 
to a couple (^x^z) of binary LDPC codes satisfying 
'^z 1 ' C ^x (see Section |n| obtained from a construction 
due to [HI07|. These LDPC codes have parity check 
matrices Hx and Hz which are (2, L) -regular, meaning 
that each column contains exactly 2 "l'"s and each row 
contains exactly i-ones. 

From this construction, a pair of g-ary LDPC codes 

{^x\^ z q) ) is deduced which satisfies Igf 1 C ^\ 
where q is some power of two, q = 2 m . These codes have 
parity-check matrices and of the same size as 
Hx and Hx respectively and which have nonzero entries 
whenever the corresponding entry of Hx (respectively 
Hz) is equal to 1, that is 



„(<?) 



xi'- ^ Xij = 1, 



^ & z itj = 1, 



(1) 



where x^x^zq 
the i-th 



ing to 



l x ' 



Hx , H^ , Hz respectively. 



, Zi j denote the entry correspond- 
row and the j-th column of 



(iii) By denoting the length of e rax,'^ ? z, 



Aq) cAi) 



by 71, and 
in the finite field 

2 m , 

through a ring isomorphism A : F2™ —> A^2 m where 
A^2 m is a certain subring of binary 2 m x 2 m matrices (i.e. 



by replacing each entry of H^' and H 
¥ q over q elements by a binary matrix of size 2 m x 



a one-to-one mapping preserving field addition and multi- 
plication), a pair of two parity-check matrices (tlx, Hz) 
is obtained. They define a pair of binary codes (^x^z) 
of length nxm satisfying the CSS condition c €z C 'fix- 
The point of this construction is that the new quantum code 
associated to the pair (^x,^z) can now be decoded on 
the extension field ¥ q and this improves dramatically the 
performance in the same way as the performance of classical 
binary (2, L) regular LDPC codes is improved by moving to 
a larger extension field F 2 >™ as shown in |Hu02|, [HEA05]. 

Our purpose in this article is here to generalize the construc- 
tion of |KHIK11| and to show that it can be applied to any 
pair of binary codes i^x^z) satisfying % C ^x which 
are LDPC codes which have parity check matrices which have 
exactly 2 "l'"s per column^] not only the particular family of 
quasi-cyclic codes of this type which are constructed in |HI07|. 
We apply this generalized construction to the toric codes of 
|Kit03 1 which are a particular instance of the CSS construction 
corresponding to a pair of LDPC codes i^x^z) which are 
(2, 4)-regular. It presents the advantage of having a minimum 
distance which grows like the square root of the length but 
has also the drawback to be able to encode only 2 qubits. We 
obtain in this way a new code family which displays several 
attractive features compared to the toric code family: 

(i) it has the same two dimensional structure as toric codes, 
this might turn out to very helpful for its implementation. 
It represents for instance a quite attractive code choice for 
performing quantum fault-tolerant computation BKit03l . 

(ii) it inherits the distance properties from the underlying toric 
code and has therefore a minimum distance which grows like 
the square root of the length, 

(iii) the number of encoded qubits is not constant anymore as 
for toric codes but grows as 2m where m is the degree of the 
extension field, 

(iv) whereas iterative decoding displays very bad performances 
when applied to toric codes, plain iterative decoding behaves 
much better for this new family of codes and when m = 9 
for instance, we obtain codes for which iterative decoding 
performs quite well (see Section \V\. 

Apart from the practical relevance of the codes constructed, 
there is also a theoretical aspect. This shows for instance 
that it is possible to obtain families of CSS codes with a 
prescribed degree distribution on the check nodes with an 
unbounded minimum distance with the construction strategy 
of MKHIK111 . It is questionable whether or not the codes 
constructed in 0KHIK1 11 meet this property (one of the 
drawback of the codes constructed there is that they start with 
a certain construction of quasi-cyclic CSS codes which can be 
easily proved to have bounded minimum distance). 

II. CSS codes and Tanner graphs 

a) CSS codes: The codes constructed in this paper fall 
into the category of Calderbank-Shor-Steane (CSS) codes 
|CS96], |Ste96| which belong to a more general class of 

'in other words they are cycle codes of a graph |HB68 |. 



quantum codes called stabilizer codes [Got97 1, [CRSS98 1. The 
first class is described with the help of a pair of mutually 
orthogonal binary codes, whereas the second class is given 
by an additive self-orthogonal code over F4 with respect to 
the trace hermitian product. Quantum codes on n qubits are 
linear subspaces of a Hilbert space of dimension 2™ and do 
not necessarily have a compact representation in general. The 
nice feature of stabilizer codes is that they allow to define such 
a space with the help of a very short representation, which 
is given here by a set of generators of the aforementioned 
additive code. Each generator is viewed as an element of the 
Pauli group on n qubits and the quantum code is then nothing 
but the space stabilized by these Pauli group elements. More- 
over, the set of errors that such a quantum code can correct 
can also be deduced directly from this discrete representation. 
For the subclass of CSS codes, this representation in terms of 
additive self-orthogonal codes is equivalent to a representation 
in terms of a pair (^x,^z) of binary linear codes satisfying 
the condition c (oz C c &x- The quantum minimum distance of 
such a CSS code is given by 

def 

do = min{dx,dz}, where (2) 

d x = unn.{\x\,x&'ifx\Vz ± }, 

d z = mm{\x\,xeV z \Vx^}. 

Such a code allows to protect a subspace of kg qubits against 
errors where 

k Q = dim Ifx ~ dim ^ ■ (3) 

fco is called the quantum dimension of the CSS code. 

b) LDPC codes: LDPC codes are linear codes which 
have a sparse parity-check matrix. They can be decoded by 
using the Tanner graph associated to such a parity-check 
matrix H. This graph is defined as follows. Assume that 
H = (Hij) i<i< r is an r x n matrix (where n is the length of 

Kj<n 

the code). The associated Tanner graph is bipartite and has: 

(i) vertex set VUC, where the first set V is in bijection with 
the indices of the columns of H, say V = {1, . . . , n} and 
is called the set of variable nodes, whereas the second 
set C is called the set of check nodes and is in bijection 
with the indices of the rows of H: C = {ffii, . . . , © r }. 

(ii) edge set E; there is an edge between ©j and j if and 
only if Hij 7^ and the edge receives label in this 
case. 

A CSS code defined by a couple of binary code (^ox^z) lS 
said to be a quantum LDPC code if and only if ^x an d c &z 
are LDPC codes. 

III. A GENERALIZATION OF THE CONSTRUCTION OF 

flKHIKllll 

We show in this section how to derive for any integer m > 1 
from a pair of binary LDPC codes i^x^z) with parity-check 
matrices and Hz satisfying 



(1) h x h z t = o, 

(2) all the columns of Hx and Hz have exactly 2 "l'"s in it, 
a pair of 2 m -ary LDPC codes (^\^ g) ) with parity-check 
matrices H^' and H^' satisfying 

(1) H^H^ )T = 0, 

(2) all the columns of Hx and Hz have exactly 2 non zero 
elements in it. 

This generalizes the construction of [KHIK11] to other codes 
than the ones obtained from [HI07] by using the ring isomor- 
phism A from the finite field F2™ to M. 2 m which is described 
in Subsection E.C of KHIK111 . 

We show the existence of the couple > < ^z ) by 

providing an efficient algorithm which outputs a couple of 
matrices (H^\H^) meeting (1) and (2). To explain how 
the algorithm works let us bring in the following definition 

Definition 1: To each row k of Hz we associate a parity- 
check matrix Hx(k) consisting of the submatrix of Hx 
formed by the columns j of such that Zf.j ^ and by 
keeping only the non zero rows in it. Let Gx(k) be the Tanner 
graph associated to this parity-check matrix. 

The crucial point is the following lemma 

Lemma 1: The degree of every variable node of Gx{k) 
is two, whereas the degree of every check node is an even 
positive number. 

Proof: The fact that the degree of every variable node 
is exactly two is a direct consequence of the fact that the 
columns of Hx(fc) are all of weight 2 since the columns of 
Hx have exactly this property. The second claim about the 
degree of the check nodes is a consequence of HxH z T = 0. 
This can be verified as follows. Each check node corresponds 
to a row of Hx(fc) which corresponds itself to some row of 
Hx- We denote such a row by i. The degree of the check 
node corresponding to i is nothing but the weight of row i 
of Hx(fc). It is equal to the number of j's such that we both 
have Xi,j = Zk.j = 1. Notice that HxHz T = implies in 
particular that 

^Xi d z kd = (4) 
3 

This implies the aforementioned claim about the degree of 
the check node, since the aforementioned number of j's is 
necessarily even in order to meet (ffl. ■ 
Since the degrees of all the vertices of Gx(k) is even, 
Gx{k) can be decomposed in an edge-disjoint subset of cycles 
Cyc x (fc). Each variable node vertex j belongs to a unique 
cycle of this kind whereas a check node ©j may belong to 
several cycles of Cyc x (k). Our strategy to ensure that there 
is a choice of Hj?' and H^' meeting Condition ([Tji and 

H^'h^' 1 = is to look for solutions which satisfy for all 
rows fc of all cycles C of Cyc x (k), and all check nodes 
(Bi belonging to C 

E = ° (5) 

where we denote by £ the set of edges of C. Notice that there 
are exactly two variable nodes which are adjacent to (Bi in C. 



The first point is that the sum J2j xfj z^ can be decom- 

j (<l) (9) 

posea as a sum 2^C:C&Cyc x (k) J s>^c Z^ededge of c x i,j z k,j 
which implies that ensuring |5) implies Q and therefore 

H^H^ = 0. Moreover the code associated to the cyclic 
Tanner graph C is non trivial if and only if the product of its 
labels on its cycle is equal to 1. We define here for a Tanner 
graph the product over a cycle by 

Definition 1 (product over a cycle of a Tanner graph): 
Let C = «i, Ci, V2, ■ ■ ■ , Cfe, v\ be a cycle in the Tanner graph 
code. Then the product over this cycle is the product of all 
the coefficients of the edges over this cycle, with a power 1 
if it is a check-to-node edge, and —1 if it is node-to-check. 
We denote this product by IT(C). 

It is namely well known that 

Proposition 1: The code associated to Tanner graph which 
is a unique cycle is not reduced to the zero codeword if and 
only if the product of the labels over the cycle is equal to 1. 
In such a case, all the non-zero codewords have only non-zero 
positions. 

The proof of this proposition is given in the appendix. 

The algorithm for choosing the entries of and is 
described below as Algorithm flj The fact that the zjf^'s can 



Algorithm 1 Choosing the entries of H^' and 

Choose the entries xf^ of H^' such that for all rows k of 
and all cycles of Cyc x (fc) the product of the labels 
x\j along these cycles is equal to 1. 
for all rows k of H^' do 

for all cycles C of Cyc x (fc) do 

Choose non-zero entries zj^ for all variable nodes j 
of C such that <|3j holds for all edges of C. 
end for 
end for 



be chosen to be different from zero comes from the fact that 
the product of the labels Xij along C is equal to 1 and from 
Proposition [T] It just amounts to choose a non-zero codeword 
in the code whose Tanner graph is given by C and the labels 
of the edges are given by the x^'s. This leads to two matrices 

T 

and H^ 9) which satisfy Condition (j} and H^H^ = 
0. Finally, it remains to explain how we choose the entries 
xfj of H^. We will actually provide an algorithm which 
provides a stronger condition on the x^'s, namely that 

for all cycles C of Gx, 11(C) = 1. (6) 

The fact that the product over all cycles of Gx will be equal 
to 1 (and not only the cycles of the subgraphs Gx(k)) will be 
quite useful when applied to the toric code and this stronger 
condition can be met with Algorithm [2] which gives a very 
large choice for the coefficients. 

Proof: (of correctness of Algorithm |2]i Let C be a cycle of 
Gx- Let us prove that 11(C) = 1. This product can be written 



Algorithm 2 Algorithm to ensure |6]) 

for all check nodes Q>k of the Tanner graph Gx associated 
to H x do 

Choose arbitrarily a non zero element a k and non-zero 
elements ajk for all variable nodes j adjacent to (Bk- 
end for 

for all variable nodes j of Gx do 

x< k] *~ a kbkjhj {Here ©, and denote the two check 
nodes adjacent to j.} 
end for 



we identify the variable node sets Vx and Vz of both codes, 
say Vx — Vz — V. These graphs are defined as follows: 

V = e [0..2n - 1] x [0..2n - 1] : i + j even} 

Cx = € [0..2n - 1] x [0..2ra - 1] : i odd, j even} 

Cz = e [0..2n - 1] x [0..2n - 1] : i even, j odd} 

A check node (i, j) is connected to 4 variable nodes (i±l, j± 
1) in both graphs (where addition is performed modulo 2n). 
The degree of the variable nodes is of course 2. 

The construction, summarized on Fig [T] has the shape of a 
torus of length and width 2n. 



as 



LT(C) — II c heck nodes e fc in c/wi 



where /(fc) counts the contribution to the product which 
involves terms which depend on k. By denoting by j and 
2 the two variable nodes adjacent to ©^ in the cycle and by 
©i and ffi m the two other check nodes which are adjacent in 
the cycle to j and I respectively we can decompose f(k) as 

f(k) = g(ij)g(jk)g(kl)g(lm) 

where g(ab) gives the part of the contribution to 11(C) stem- 
ming from edge ab by keeping only elements of the product 
which depend on k. We observe now that g(ij) = &jy, 
g(jk) = a^b^j, g{kl) = a k b kl and g(lm) = b^ 1 . This 
implies f(k) = 1, which in turn implies that 11(C) = 1. ■ 
Remark: One might wonder whether or not it is possible 
to obtain gr-ary versions of Hx and Hz which satisfy the 

orthogonality condition Hy'H^' = when the columns of 
Hx and Hz have weight greater than 2. While this can be 
easily done for certain structured constructions such as the 
one proposed in [TZ09j, it is not clear how to achieve this 
in all generality. The difficulty is the following. Consider the 
code defined by a Tanner graph which is a subgraph of Gx 
labelled by a certain choice of the x\j and which consists in 



codewords of the form (z k q ^)j :Zk j= i satisfying Q. All these 

codes (for k ranging over all rows of H^ } ) should be not 
reduced to the zero codeword. While this is easily achieved 
in the case of column weight 2 essentially by the fact that the 
number of check nodes of the Tanner graphs Gx(k) is always 
less than or equal to the number of variable nodes (since by 
Lemma [T] the degree of the check nodes is greater than or 
equal to 2 and the degree of the variable nodes is constant 
and equal to 2), this is not the case anymore when the column 
weight is higher. 

IV. An application: the extended toric code 

A. Definition of the toric code and its extended version 

The toric code (see [BK98| for more details) is a CSS code 
of length 2n 2 which encodes 2 qubits. It is convenient to 
define the Tanner graphs Gx and Gz of the couple i^x^z) 
of binary codes of the CSS code simultaneously. Let Cx and 
Cz be the set of variable nodes of Gx and Gz respectively and 



(i-ljl 




Fig. 1. The Tanner graph of the toric code, with both X and Z parts 
together. The black dots represent the qubits, the dotted crosses the checks 
of the Z part, and the black crosses the checks of the X part. The left part 
is identified to the right part, and the upper part to the lower part, so that the 
global shape of the graph is a torus. 

Even if this code has as many checks as qubits, its di- 
mension is positive: the rank of Hx and Hz associated to 
Gx and Gz is n 2 — 1 instead of n 2 , thus the dimension is 
dim( < ^ x )-dim( < ^ z ± ) =n 2 + l-(n 2 -l) = 2 (from (|3)). The 
code has a rather large minimum distance [Kit03 |, however its 
performances when decoded with standard belief propagation 
is quite bad, because of the presence of many small cycles 
and also because the (classical) minimum distance of ffx and 
c £z is only 4. 

Now we construct a g-ary version of this code, in the same 



way as in Section HI In other terms, we just put some non-zero 
labels on the edges of the graph. For simplicity of notation we 
will further use Xij to design x[ 9 ^, the label in ¥ q \ {0} on 
the edge between check i and node j. Labeling is performed 

and bjk 

<4 9) ) of 



through Algorithm [T] by choosing the coefficients 
at random in Algorithm [2] We obtain a couple (^J^ 
q-ary codes satisfying 



(9)- 



C 



(<!) 



We obtain the extended toric code by applying the aforemen- 
tioned ring isomorphism to the entries of the parity-check 



matrices and of ' 



?(<?) 
•x 



and c €^ ) : the resulting code 



has length 2mn 2 . We denote the couple of binary codes 
defining this toric code by {^x^z)- 

B. Dimension 

Strictly speaking, by applying Algorithm [T] the dimension 



of -4 9) minus the dimension of < € z 



(<?)- 



could be smaller than 



dim^x - dim^. Indeed and H 



Aq) 



might now be of 

dim-4 9 ^ = n 2 . 



full rank and we might have dim' 

This would imply that dim = dim ^z and the quantum 
dimension of the extended toric code would be 0. However, 
when we apply Algorithm [2] to choose the labels (so that the 
product of the labels xi j over all cycles of Qx is equal to 1), 
then it will turn out that 



dim^ 



(?) 



x 



dim^, 



(<!)- 



dim ' 



'X 



dim ' 



so that dim^x — dim^ = 2m. This means that 

Theorem 1 (Dimension of the extended toric code): If 



, and^ 9j are constructed such that verifies (|6]l and 



2m. 



C tf- 



Ui) 



x 



then the extended toric code has dimension 



Proof: This is shown with the help of two lemmas: 

Lemma 2: If ^ verifies (|6) and ^ C 'afj^, then ^ 
verifies also |6]). 

Lemma 3: If verifies |6|, then it has g-ary dimension 
n 2 + 1. 

From these two lemmas, we obtain that the dimension of 



and c £ z 



(?) • 2 

w is n 



1, which gives 



dim%? 



(</) 



x 



dim "if, 



(<?)- 



V. Results 

We have implemented standard belief propagation over ¥2^ 
to decode extended toric codes for several values of n and to 
(see Section III of IKHIKl II ) but which correspond to the 
same final length 2mn 2 , which is 1152 here. We have chosen 

(i) m = 1, n = 24, 

(ii) to = 4, n = 12 

(iii) m = 9, n = 8. 

The channel error model is the depolarizing channel model 
with depolarizing probability p, meaning that the probability 
of an X, Y or Z error is p/3 which implies that the codes ^x 
and see a binary symmetric channel of probability 

The performance of belief propagation is quite bad in the 
binary case (that is for standard toric code), even if the qubit 
error rate is rather low, the whole error is typically badly 
estimated. On the other hand the performances get better by 
moving from F 2 to F 16 and become quite good over F 512 . 
This is remarkable since the length of these CSS codes is 
constant but the rate increases with to. For instance, the rate 
of the toric code is whereas the rate of the extended toric 
code over F512 is equal to gj. It would be interesting to carry 
over the renormalizing approach of [DCP10] which improves 
dramatically belief propagation over standard toric codes and 
study how much it is able to improve the performance of 
standard belief propagation over these larger alphabets. 



This implies that the quantum dimension of the extended toric 
code is 



dim(^x) - dim(^z ) = m(dim^ x 3) - dim^ ) = 2to 



The proof of the two lemmas is given in the appendix. 

C. Minimum distance 

Choosing the product of the labels to be equal to 1 on all 
cycles of Qx brings another benefit : it allows to control the 
minimum distance, since we have in this case 
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probability on the depolarizing channel 



Lemma 4: min{|x| G \ ^z" } ~ min{|x| € ^ z q> \ pig. 2. Word and qubit error rates for several extended toric codes. 

tfj?^} = n. 

The proof is given in the appendix. This implies that 

Theorem 2 (minimum distance of the extended toric code): 
The minimum distance of the extended toric code is > n. 
Proof: The minimum distance of the extended toric code 

is the minimal weight of a word from 'rfxX'&z or 'tfzX'&x ■ 
The Hamming weight of such a word is greater than or equal 

to the Hamming weig ht of the word in ^ \ tf z q) or ^ z q) \ 

^ corresponds to after taking the aforementioned ring 
isomorphism A. ■ 
Remark 1: There is also an upper bound on the minimum 
distance: it is at most nm, since a word of weight n in ¥ q has 
minimal weight n and maximal weight mn in F 2 . 



=>(<?)- 



Appendix 

Proof of Proposition [7} Let us consider a Tanner graph 
composed of a cycle Vq, Cq,V\, C\, . . . Vf--i, Cfc-i, and let 
Xij be the label on the edge between check i and node j. 
A codeword WoWi, . . -Wk-i of the code associated to this 
Tanner graph is such that: 

XQflWo + I ,lWl = 
XlAWx + XioW 2 = 



Xk-i,k-iw k -i + x k - ly0 w = 



This system has non-trivial solutions if and only if the deter- 
minant of this system is 0, ie if: 



A 



■ Xk-l,k-l + xo,i ■ 



■ Xk-1,0 



xo,ox \ 



■ Xk-l t k-l x k-l,0 



which means that the product over the cycle is 1. 

If this condition is verified, and one of the Wi's is zero, for 
example Wq, we can see from the system that wi, . . .Wk-i 
have to be equal to zero too. So the non-zero codewords have 
only non-zero positions. ■ 
Proof of Lemma |2j 

We consider here two basic types of cycles in the Tanner 
graphs of Ifjp and : the minimal cycles of length 8, and 
cycles of length 2n that go through the length or the width of 
the torus, we call the last ones "big cycles". An example is 
shown on Fig [3] 




is a 



Fig. 3. Two big cycles (horizontal, vertical) and a minimal cycle. 

More formally, 

Definition 2 (minimal cycle): A minimal cycle in the Tan- 
ner graph of or is a cycle of the form: (i + 

(t + 1, j - 1), (i + 2), - 2), (i -l,j- 2), (t - 

1, 3 ~ 1)) (i - with i + 3 even > so that {hi) is a 

variable node. 

Definition 3 (Big cycle): A horizontal big cycle in the Tan- 
ner graph of 'tfjp or is a cycle of the form: (i + 
l,j),...(i + 2n- with i+j even. F] 
A vertical big cycle in the Tanner graph of ^yOr 
cycle (i,j + l), . . . (i,j+2n-l), with i+j even. 

Our first observation is that it is enough to prove Condition 
(|6) on the minimal cycles and the big cycles of the Tanner 
graph of ^2 , smce the product of any other cycle in this 
Tanner graph can be decomposed as a product of products 
over these basic cycles. 

Let us now consider a 4-cycle which lives in the union 
of the two Tanner graphs of 'tfjp and If^. It consists in 
two checks (see Fig [4} A and B, that are both connected to 
two variable nodes 1 and 2. From the orthogonality constraint 

H^H^ = 0, we deduce that the labels on the edges of 
this cycle satisfy 



XA.lZBA + X A ,2ZB,2 = 



recall that addition on the indices is performed modulo In. 



© 



XA,2 



XA,1 1 
• 

zba 



2 Z B ,2 y ~ y 
Fig. 4. A 4-cycle. The black cross A is a check from 

B is one from ^"1 , an d the dots 1 and 2 are the qubits where they interact 



the dotted cross 



We can reformulate this: 



X A,\Z B ,\Z q 2 X A.2 



= 1 



With the following definition, we obtain in this way that the 
product over such cycles of size 4 is equal to 1, 

Definition 4 (Product over a cycle - extended version): 
The notion of product over a cycle can be extended to 



the union of the Tanner graphs of ^ and Kg 1 . If 
V\, Ci, «2, • • • , Cft, V\ is a cycle in this union, the product over 
this cycle is the product of all the labels of the edges over 
this cycle, with a power: 

• 1 if the edge is check-to-variable node and belongs to the 
X-part, 

• —1 if the edge is variable node-to-check and belongs to 
the X-part, 

• —1 if the edge is check-to-variable node and belongs to 
the Z-part, 

• 1 if the edge is variable node-to-check and belongs to the 
Z-part, 

Now, let us look at a combination of 4 such small cycles, 
as in Fig [5] 



*(9) 



.4 



ZAA\ 

4 • 



ZA,1 



XEA 



ZB,U 



XEA 



XE.2 



D 



%E,3 



ZB,2 

• 2 

ZC,2 



' -M 

ZC,3'vTV C 



Fig. 5. A view of a X check (E) with the four related Z checks around 
(A, B, C, D). 

The product over all small cycles is 1 : 

z aa x E A x E A Z AA = 1 
z B,l Z B,2 X E,2 X E,l = 1 
XE,2ZC,2Zc i3 Xg t3 = 1 

X E,4 Xe ,3 Z D,3 Z D,4, = 1 

By multiplying all these equations, we obtain: 

ZAA z AA z BA z B^2 Z C,2Z c ^ i ZE^Z D ^ i = 1 

which is exactly the product over a minimal cycle of ■ 

Now, we consider another combination of 2n 4-cycles such 
as in Fig |4] among one direction of the torus, as shown in 
Fig [6] It consists, in the subgraph of both Tanner graphs, in 
the variable and check nodes in the cartesian product [0..2n — 



1] x {0,1}. To simplify notation we have relabeled a variable 
node (i, 0) by | + n, a variable node (i, 1) by a check 
node (i, 1) corresponding to by | and a check node (z, 



corresponding to also by 



It is summarized in Fig 



'X\ I 



^0,0 



Xl,r, 

' i x 

Fig. 6 



+ 1 




Cx2 2 



n+2 



e 

•i'.i 



n+3 



1 Czl n + 2 C Z 2 71 ^ 



An ensemble of small cycles of (black) and (dotted). 




■e— © 



The product over all such cycles is 1, ie: 



X\ \Z\ iz in + l~ 1 x~ 1 



1 

l,n+l 



•^n— l.n— l^n— l,n— — i,2n— l,2n— 1 



= 1 



^0 n — l^O.n^n- l.n^n — l,n— 1 



1 



By multiplying all these equations, we get: 

x n-l,n-l z n -i2n-l X Q,n-l Z n-l,n = 1 

(xrj, 0^1.0 ■ • ■ x n— l,n— l x 0,n— l) X 
( Z 0,n Z 0,?i+l • • • Z n-l,2n-l Z n-l,n) = 1 

The first parenthesis is the product over a big cycle of^\ 
and the second parenthesis is the product over a big cycle of 

It shows that if the product over a big horizontal cycle is 
equal to 1 in , then the product over a big horizontal 
cycle in is also equal to 1. There is a similar proof for 
the vertical cycles. ■ 
Proof of lemma ^ 

First, we show that the dimension of ^jf 1 is at least n 2 + 1. 

The idea is to construct a set of independent codewords 
associated to cycles of the Tanner graph of c <a^\ This is 
obtained as follows. Since all variable nodes of this Tanner 
graph have degree 2, we can consider the graph of the checks, 
where the vertices are the checks, and there is an edge between 
two vertices if and only if there is a variable node that is 
adjacent to the two checks. Informally, it just consists of the 
same graph where an "edge-variable node-edge" is replaced 
by a single edge. We consider a spanning tree of this graph. 
An example of such spanning tree is shown in Fig [7] 

This spanning tree has of course n 2 checks, and therefore 
n 2 — 1 edges between these checks. There are n 2 + 1 other 
edges: let e 1; . . . e„2 +1 be such edges. For all i, adding 
to the spanning tree provides a unique cycle, Cj. Let c'j be 
the corresponding cycle in the original Tanner graph. Now, 
the product over each such cycle is 1. From Proposition [T] 
each of these cycles provides a codeword of ^ ■ These 



Fig. 7. A spanning tree (black) of the graph of the checks. The Tanner 
graph of the toric code is shown in grey. 



positions which correspond to the edges ei, . . . e„2 +1 , exactly 
one of these codewords has a non zero entry (for the edge e, 
it is precisely c[ which has a non zero entry for this position). 

To show that this dimension is at most n 2 + 1, we remove 
a certain check, say check c r . We want to show that the 
remaining n 2 — 1 checks are independent. To obtain this, we 
prove that for any syndrome, we can construct an error that 
gives this syndrome. In particular, we show that for every 
check Co, we can get the syndrome (0, ... 0, 1, 0, ... 0) with 1 
at position cq. 

Let cq , v% , ci , i>2, . . . Ufe , c r be some path in the Tanner graph 
that links cq to c r . An example of such path is shown in Fig [8] 




Fig. 8. A path between the removed check c r (grey) and some check cq 
(bold). 

Now we construct an error E that has in every position 
except the tij's: 

• E Vl is such that the syndrome in c is 1, ie E Vl — 1/xq i 



E V2 such that c\ has syndrome 0, ie E V2 
so on and so forth. 



E v , and 



Since c r has been removed, all the checks except cq are 
satisfied. ■ 
Proof of Lemma Wj 
Consider an element E of minimal weight in the set 

1*1° x ■ ^ e are S om § to P rove th at its weight is greater 
than or equal to n. A similar proof shows that this is also the 



case for the minimal weight elements of ^ x 
proves the lemma 



(9) lc*(q) 



± 



and this 



1 



1 codewords are necessarily independent, since for all the n 2 + 1 and the dimension of is n 2 ~l. Then the quotient 



From Lemma |3| we know that the dimension of 

(?)-' 



is 



AqY 
7 x 



has dimension 2, consequently we just need to 
find two independent codewords X\ and X 2 G ^zf* l^x^ > 



«i X\ + a 2 X 2 , 
and at least one of either a\ 



and any such E can be written as E = E s 

with a 1 ,a 2 £ ¥ q ,E s £ ^ , 
or Qf2 should be non zero. 

We claim that we can choose X\ to be a codeword provided 
by a big vertical cycle of the Tanner graph of (obtained 
from Proposition [TJ, and X 2 being defined similarly with a 
big horizontal cycle. We also define Z\ and Z 2 , provided by 
respectively a big horizontal cycle and a big vertical cycle of 
<o x . 

We notice that the following inner product is non zero 

x 1 z 1 T ± 

since there is only one coordinate where both X\ and Z\ are 
not zero. 

Note that X\ belongs to c ^ by definition and X x > 
otherwise X\ would have been orthogonal to all words of 
, including Z\. We have the same result for X 2 (and Z 2 ). 
We finally just need to prove that they are independent: 

Assume that X\ — aX 2 + E s , with a in V q \ {0} and E s 
in . Then we would have 



X\Z\ = aX 2 Zi + E S Z\ 

- - T 

= aX 2 Z\ 

since E a Z~x = because Z x £ Sfj? 5 and £ s £ ^ . 

The left part is non zero, and the right part is zero, since 
the supports of X 2 and Z\ are disjoint. This leads to a 
contradiction. 

Now assume E is of the form E = aiXi+a 2 X 2 +E s with 
either a± or a 2 being different from 0. We want to show that 
this error is of weight at least n. Assume now that a.\ 7^ 0. 

We introduce n shifts of Z~±: Z± , for all i even, i £ 
{0, . . . 2n — 2} which is the codeword provided by the cycle: 
(0, i), (1, i), . . . (2n — l,i),(0,i). They are just horizontal 
cycles, at different "heights", as shown in Fig [9] 

We have 

E s {Z 1 i ) T = Q 



since Z\ £ 



and E s £ r t 



and 



Xx{Z 



Notice that we have for all i: 



= atX^Y + a 2 X 2 {Z x °) 
= ai Xi(Zi) T 
+ 



This implies that for all i, E has at least a non-zero coordinate 
on the support of Z\ . Since all Zi*'s have disjoint support, 
it shows that E has at least n non-zero coordinates. A similar 
reasoning holds in the case a 2 ^ by multiplying by Z 2 this 
time. 




Fig. 9. The toric code (only the Z-part is displayed) of length 6 with X\ 
and several variants of Z\ 
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